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Using a large set of daily US and Japanese stock returns, we test in detail the rele- 
vance of Student models, and of more general elliptical models, for describing the joint 
distribution of returns. We find that while Student copulas provide a good approxima- 
tion for strongly correlated pairs of stocks, systematic discrepancies appear as the linear 
correlation between stocks decreases, that rule out all elliptical models. Intuitively, the 
failure of elliptical models can be traced to the inadequacy of the assumption of a sin- 
gle volatility mode for all stocks. We suggest several ideas of methodological interest to 
efficiently visualise and compare different copulas. We identify the rescaled difference 
with the Gaussian copula and the central value of the copula as strongly discriminating 
observables. We insist on the need to shun away from formal choices of copulas with no 
financial interpretation. 

Keywords: Copulas; Stock returns; Multivariate distribution; Linear correlation; Non- 
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1. Introduction 

The most important input of portfolio risk analysis and portfolio optimisation is 
the correlation matrix of the different assets. In order to diversify away the risk, one 
must avoid allocating on bundles of correlated assets, an information in principle 
contained in the correlation matrix. The seemingly simple mean- variance Markowitz 
program is however well known to be full of thorns. In particular, the empirical de- 
termination of large correlation matrices turns out to be difficult, and some astute 
"cleaning" schemes must be devised before using it for constructing optimal allo- 
cations This topic has been the focus of intense research in the recent years, 
some inspired by Random Matrix Theory (for a review see |13I38| ) or clustering 
ideas |32l42143| . 
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There are however many situations of practical interest where the (linear) cor- 
relation matrix is inadequate or insufficient [1130133] . For example, one could be 
more interested in minimising the probability of large negative returns rather than 
the variance of the portfolio. Another example is the Gamma-risk of option port- 
folios, where the correlation of the squared-returns of the underlyings is needed. 
Credit derivatives are bets on the probability of simultaneous default of companies 
or of individuals; again, an estimate of the correlation of tail events is required (but 
probably very hard to ascertain empirically) |19j , and for a recent interesting review 

Apart from the case of multivariate Gaussian variables, the description of non- 
linear dependence is not reducible to the linear correlation matrix. The general 
problem of parametcrising the full joint probability distribution of N random vari- 
ables can be "factorised" into the specification of all the marginals on the one hand, 
and of the dependence structure (called the 'copula') of N standardised variables 
with uniform distribution in [0, 1], on the other hand. The nearly trivial statement 
that all multivariate distributions can be represented in that way is called Sklar's 
Theorem [1414=1] . Following a typical pattern of mathematical finance, the intro- 
duction of copulas ten years ago has been followed by a calibration spree, with 
academics and financial engineers frantically looking for copulas to best represent 
their pet multivariate problem. But instead of trying to understand the economic 
or financial mechanisms that lead to some particular dependence between assets 
and construct copulas that encode these mechanisms, the methodology has been — 
as is sadly often the case — to brute force calibrate on data copulas straight out 
from statistics handbooks [12115116117136] . The "best" copula is then decided from 
some quality-of-fit criterion, irrespective of whether the copula makes any intuitive 
sense (some examples are given below). This is reminiscent of the 'local volatility 
models' for option markets [llj : although the model makes no intuitive sense and 
cannot describe the actual dynamics of the underlying asset, it is versatile enough 
to allow the calibration of almost any option smile (see [22]). This explains why this 
model is heavily used in the financial industry. Unfortunately, a blind calibration 
of some unwarranted model (even when the fit is perfect) is a recipe for disaster. 
If the underlying reality is not captured by the model, it will most likely derail in 
rough times — a particularly bad feature for risk management! Another way to 
express our point is to use a Bayesian language: there are families of models for 
which the 'prior' likelihood is clearly extremely small — we discuss below the case 
of Archimedean copulas. Statistical tests are not enough — intuition and plausibilty 
are mandatory. 

The aim of this paper is to study in depth the family of elliptical copulas, in 
particular Student copulas, that have been much used in a financial context and 
indeed have a simple intuitive interpretation [14J1 8 23 27 30 40]. We investigate in 
detail whether or not such copulas can faithfully represent the joint distribution of 
the returns of US stocks. (We have also studied other markets as well). We unveil 
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clear, systematic discrepancies between our empirical data and the corresponding 
predictions of elliptical models. These deviations are qualitatively the same for dif- 
ferent tranches of market capitalisations, different time periods and different mar- 
kets. Based on the financial interpretation of elliptical copulas, we argue that such 
discrepancies are actually expected, and propose the ingredients of a generalisa- 
tion of elliptical copulas that should capture adequately non-linear dependences in 
stock markets. The full study of this generalised model, together with a calibration 
procedure and stability tests, will be provided in a subsequent publication. 

The outline of the paper is as follows. We first review different measures of de- 
pendence (non- linear correlations, tail dependence), including copulas. We discuss 
simple ways to visualise the information contained in a given copula. We then in- 
troduce the family of elliptical copulas, with special emphasis on Student copulas 
and log-normal copulas, insisting on their transparent financial interpretation, in 
contrast with the popular but rather implausible Archimedean copulas. We then 
carefully compare our comprehensive empirical data on stock markets with the pre- 
dictions of elliptical models — some of which being parameter free, and conclude 
that elliptical copulas fail to describe the full dependence structure of equity mar- 
kets. 



2. Bivariate measures of dependence and copulas 

In this section, we recall several bivariate measures of dependence, and study their 
properties when the distribution of the random pair under scrutiny is elliptical. The 
incentive to focus on bivariate measures comes from the theoretical property that 
all the marginals, including bivariate, of a multivariate elliptical distribution are 
themselves elliptical. In turn, a motivated statement that the pairwisc distributions 
are not elliptical is enough to claim the non-cllipticity of the joint multivariate 
distribution. This is the basic line of argumentation of the present paper. 



2.1. Correlation coefficients 

Beyond the standard correlation coefficient, one can characterise the dependence 
structure through the correlation between powers of the random variables of inter- 
est: 

(dj = nSiSAXjXtf] - E[g a |X^]E[^|X 3 f ] 

.(dj _ n^xA d ]-nxi\ d ]nM d ] (0 ~, 



where Si = sign(Xi) and provided the variances V in the denominators are well 
defined. The case p {d) for d = 1 corresponds to the usual linear correlation coefficient, 
for which we will use the standard notation p, whereas C (d> for d = 2 is the correlation 
of the squared returns, that would appear in the Gamma-risk of a A-hedged option 
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portfolio. However, high values of d are expected to be very noisy in the presence of 
power-law tails (as is the case for financial returns) and one should seek for lower 
order moments, such as £ (d) for d = 1 which also captures the correlation between 
the amplitudes (or the volatility) of price moves, or even p <s) = p (d) for d = that 
measures the correlation of the signs. 

In the case of bivariate Gaussian variables with zero mean, all these correlations 
can be expressed in terms of p. For example, the quadratic correlation £ <2) = £ <d) 
for d = 2 is given by: 

C (2) = p 2 ; (2.3) 

whereas the correlation of absolute values = £ (d) for d = 1 is given by: 

C« = ^^, (2-4) 

2 1 

with D (r) = s/l — r 2 + r arcsinr. The correlation of signs is given by 
P (s) = § arcsin p. For some other classes of distributions, the higher-order coeffi- 
cients p {d) and C <d) are explicit functions of the coefficient of linear correlation. This 
is for example the case of Student variables (see Fig. [TJ and more generally for all 
elliptical distributions (see below). 



2.2. Tail dependence 

Another characterisation of dependence, of great importance for risk management 
purposes, is the so-called coefficient of tail dependence which measures the joint 
probability of extreme events. More precisely, the upper tail dependence is defined 
as [T4130] : 

r uv (p) = P [X, > V~l ( P ) | X, > V~] (p)] , (2.5) 

where V < ^k is the cumulative distribution function (cdf) of Xk, and p a certain 
probability level. In spite of its seemingly assymetric definition, it is easy to show 
that t uv is in fact symmetric in Xi <H- Xj when all univariate marginals are identical. 
When p — > 1, t* l , = t uv {\) measures the probability that X, takes a very large 
positive value knowing that Xj is also very large, and defines the asymptotic tail 
dependence. Random variables can be strongly dependent from the point of view 
of linear correlations, while being nearly independent in the extremes. For example, 
bivariate Gaussian variables arc such that t* v = for any value of p < 1. The lower 
tail dependence t ll is defined similarly: 

t ll (p) =F[X l < VZl{l - p) | X 3 < V-j(l - p)] , (2.6) 

and is equal to T uu {p) for symmetric bivariate distributions. One can also define 
mixed tail dependence, for example: 

T LU (p) =F[Xi< V-j(l - p) | X S > V-J(p)] , (2.7) 

with obvious interpretations. 
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2.3. Copulas 



There are many other possible coefficients of dependence, such as Spearman's rho 
or Kendall's tau, that measure, respectively, the correlation of ranks or the "con- 
cordance" probability (see e.g. [39] for an introduction). Both these measures are 
invariant under any increasing transformations. More generally, the copula (gener- 
alisable to dimensions larger than 2), encodes all the dependence between random 
variables that is invariant under increasing transformations. The copula of a random 
pair (Xi,Xj) is the joint cdf of E7, = V <t i(Xi) and Uj = V < j(Xj): 

C{u u u 3 ) = P [P <ti (Xi) < m and T <J {X J ) < u 3 ] (2.8) 

Since the marginals of Ui and Uj are uniform by construction, the copula 
only captures their degree of "entanglement" . For independent random variables, 
C{ui, uj) = UiUj. 

Whereas the p (d) 's and ( (d), s depend on the marginal distributions, the 
tail dependences, Spearman's rho and Kendall's tau can be fully expressed 
in terms of the copula only. For example, Kendall's tau is given by r (K) = 
4 Jj j,., C(ui, Uj)dC(Ui, Uj) — 1, while the tail dependence coefficients can be ex- 
pressed as: 

l-2p + C(p,p) C(l-p,l-p) 

: =T uu {p) =T LL {p) (2.9a) 

1 — p 1 — p 

1 -p - cjp, / v 1 -p-C{l-p,p) , . , . 

: = T UL (p) = T LU {p). 2.9b 

1 — p 1 — p 

Copulas are not easy to visualise, first because they need to be represented as 
3-D plot of a surface in two dimensions and second because there are bounds (called 
Frechet bounds [30]) within which C(u, v) is constrained to live, and that compresse 
the difference between different copulas (the situation is even worse in higher di- 
mensions). Estimating copula densities, on the other hand, is even more difficult 
than estimating univariate densities, especially in the tails. We therefore propose to 
focus on the diagonal of the copula, C(p,p) and the anti-diagonal, C(p, 1 — p), that 
capture part of the information contained in the full copula, and can be represented 
as 1-dimensional plots. Furthermore, in order to emphasise the difference with the 
case of independent variables, it is expedient to consider the following quantities: 

C(p,p)-p 2 =Tuu{p ) +Tll{1 - p) _ 1 (2.10a) 
p{l-p) 

C^-ti-rt-P) = 1 - TuL ( P ) - r LU (l - P), (2.10b) 
p{l-p) 

where the normalisation is chosen such that the tail correlations appear naturally. 
Note in particular that the diagonal quantity tends to the asymptotic tail depen- 
dence coefficients r* n (t* l ) when p — > 1 (p — > 0). Similarly, the anti-diagonal tends 
to — t* as p — > 1 and to — t* , as p — > 0. 



June 5, 2012 1:24 WSPC/INSTRUCTION FILE 

Chicheportiche-Bouchaud-2010 



6 R. Chicheportiche and J. -P. Bouchaud 



Another important reference point is the Gaussian copula Cg{u, v), and we will 
consider below the normalised differences along the diagonal and the anti-diagonal: 

a / \ C{p,p) - C G (p,p) C(p,l-p) - C G (p,l-p) 

A d(p) = -r. 7 ; A a (p) = r , (2.11) 

p(l-p) p(l-p) 

which again tend to the asymptotic tail dependence coefficients in the limits p — > 1 
(p —> 0), owing to the fact that these coefficients are all zero in the Gaussian case. 

The centerpoint of the copula, C(i, i), is particularly interesting: it is the prob- 
ability that both variables are simultaneously below their respective median. For 
bivariate Gaussian variables, one can show that: 

C G {\, ±) = \ + i- arcsinp = \ (1 + p<*>) , (2.12) 

where p (a) is the sign correlation coefficient defined above. The trivial but remarkable 
fact is that the above expression holds for any elliptical models, that we define and 
discuss in the next section. This will provide a powerful test to discriminate between 
empirical data and a large class of copulas that have been proposed in the literature. 



3. Review of elliptical models 
3.1. General elliptical models 

Elliptical random variables Xi can be simply generated by multiplying standardised 
Gaussian random variables ej with a common random (strictly positive) factor cr, 
drawn independently from the e's from an arbitrary distribution P(cr): |14j 

A, //, • i = l.. .N (3.1) 

where \Xi is a location parameter (set to zero in the following), and e ~ jV(0, £). Such 
models are called "elliptical" because the corresponding multivariate probability 
distribution function (pdf) depends only on the rotationally invariant quadratic 
form (x — ij,) t Y 1 ~ 1 (x — fj,); therefore quantile levels define ellipses in N dimensions 
(see [4] for the construction and properties of clliptically contoured multivariate 
distributions) . 

In a financial context, elliptical models are basically stochastic volatility models 
with arbitrary dynamics such that the ergodic distribution is P(a) (since we model 
only single-time distributions, the time ordering is irrelevant here). The important 
assumption, however, is that the random amplitude factor a is equal for all stocks 
(or all assets in a more general context). In other words, one assumes that the mech- 
anisms leading to increased levels of volatility affect all individual stocks identically. 
There is a unique market volatility factor. Of course, this is a very restrictive as- 
sumption since one should expect a priori other, sector specific, sources of volatility. 
This, we believe, is the main reason for the discrepancies between elliptical models 
and the empirical data reported below. 

Some of the above measures of dependence can be explicitely computed for 
elliptical models. Introducing the following ratios: fd = E[cr 2d ]/E[<7 d ] 2 , one readily 
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finds: 



•J/2 - 1 

e\p) = h : D f {p) - 1 (3.2b) 

2/1 - 1 

C*(p) = j + ^ arcsinp = - (1 + p (3 ») , (3.2c) 

4 Z7T 4 

where C* = C(|, Note that /2 is related to the kurtosis of the X's through the 
relation k = 3(/a — 1). The calculation of the tail correlation coefficients depends on 
the specific form of P{cr), for which several choices are possible. We will focus in the 
following on two of them, corresponding to the Student model and the log-normal 
model. 

3.2. The Student ensemble 

When the distribution of the square volatility is inverse Gamma, i.e. P(u = a 2 ) oc 
c~u/u 1+ "/ 2 , the joint pdf of the returns turns out to have an explicit form |9|14|30j : 

1 T(^) / xTE-^xY^ , . 

Ux) = —= K ,1' 1 + (3.3) 

u{ ' v^rFditS r(|) \ v ) ' 

This is the multivariate Student distribution with v degrees of freedom for N ran- 
dom variables with dispersion matrix S. Clearly, the marginal distribution of t v [x) 
is itself a Student distribution with a tail exponent equal to v, which is well known to 
describe satisfactorily the univariate pdf of high-frequency returns (from a few min- 
utes to a day or so), with an exponent in the range [3, 5] (see e.g. Refs. {118120137] ^) . 
The multivariate Student model is therefore a rather natural choice; its correspond- 
ing copula defines the Student copula. For N = 2, it is entirely paramctcrised by v 
and the correlation coefficient p — £12/^^11^22. 

The moments of P(a) are easily computed and lead to the following expressions 
for the coefficients fd- 

when v > 2 (resp. v > 4). Note that in the limit v — > 00 at fixed N, the multivariate 
Student distribution boils down to a multivariate Gaussian. The shape of C (1> (p) 
and C (2> (p) for v = 5 is given in Fig. [TJ 

One can explicitely compute the coefficient of tail dependence for Student vari- 
ables, which only depends on v and on the linear correlation coefficient p. By sym- 
metry, one has T uu (p;v 1 p) = T LL (p;is,p) = T UL (p;v,-p) = r LU (p;v, -p). When 
p = 1 — e with e — > 0, one finds: 

2 4 

T un {p;v,p) = T* u (v,p) + P{v,p) ■ e» +C(e-), (3.5) 

where T* v (y,p) and /3(v,p) are coefficients given in |Appcndix A| see also 
Figs. [1] and [2] The notable features of the above results are: 
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linear correlation 
quadratic correlation 
absolute correlation 
tail dependance 



0.0 



Fig. 1. For all elliptical distributions, the different measures of dependence are functions of p. 
Illustration for a pair with a bivariate Student distribution (u = 5). 



The asymptotic tail dependence T* v (y,p) is strictly positive for all p > — 1 
and finite v, and tends to zero in the Gaussian limit v — > oo (see Fig. 2(a) |. 
The intuitive interpretation is quite clear: large events are caused by large 
occasional bursts in volatility. Since the volatility is common to all assets, 
the fact that one return is positive extremely large is enough to infer that 
the volatility is itself large. Therefore that there is a non-zero probability 
t* lj {v, p) that another asset also has a large positive return (except if p = —1 
since in that case the return can only be large and negative!). It is useful 
to note that the value of T* v (v,p) does not depend on the explicit shape 
of -P(a) provided the asymptotic tail of P{a) decays as L(a) / a 1+v , where 
L(a) is a slow function. 

The coefficient /3(i/, p) is also positive, indicating that estimates of t* v (v^ p) 
based on measures of T uv {jp; v, p) at finite quantiles (e.g. p = 0.99) are biased 
upwards. Note that the correction term is rapidly large because e is raised 
to the power 2/v. For example, when v = 4 and p = 0.3, f3 sa 0.263 and 
one expects a first-order correction 0.026 for p = 0.99. This is illustrated in 
Figs. 2(a)| 2(b) The form of the correction term (in e 2 ^ u ) is again valid as 
soon as P(<r) decays asymptotically as L{a) / a 1+u . 

Not only is the correction large, but the accuracy of the first order expansion 
is very bad, since the ratio of the neglected term to the first correction 
is itself ~ e 2 /" . The region where the first term is accurate is therefore 



exponentially small in v — see Fig. 2(b) 



Finally, we plot in Fig. [3] the rescaled difference between the Student copula 
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1 i i i i i r~ 

2 4 6 8 10 12 14 

v 



(a) Tuu vs. v at several thresholds p for bi- 
variate Student variables with p = 0.3. Note 
that t^j V — ¥ when v — > oo, but rapidly 
grows when p ^ 1. 




0.90 0.92 0.94 0.96 0.98 1.00 
P 

(b) tuu vs p for bivariate Student variables 
with p = 0.3 and v = 5: exact curve (plain), 
first order power-law expansion (dashed) and 
simulated series with different lengths T 
(symbols) 



Fig. 2. 



and the Gaussian copula both on the diagonal and on the anti-diagonal, for several 
values of the linear correlation coefficient p and for v = 5. One notices that the 
difference is zero for p = i . as expected from the expression of C* (p) for general 
elliptical models. Away from p = h on the diagonal, the rescaled difference has a 
positive convexity and non-zero limits when p — ¥ and p — ¥ 1, corresponding to t* l 
and t* v . 




— p = -0.1 






p = 




p = 0.2 




p = 0.5 











Fig. 3. Diagonal and anti-diagonal of the Student copula: the quantities A;j(p) and A a (p) defined 
in Eq. d2 . 1 1 ft are plotted versus p for several values of p and fixed v = 5. (The curves are identical 
in the range p G [0,0.5] due to the symmetry p «-» 1— p). 
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3.3. The log-normal ensemble 

If we now choose a = aoe^ with £ ~ Af(0, s) (as suggested by advocates of mul- 
tifractal models I3|28|35j ). the resulting multivariate model defines the log- normal 
ensemble and the log-normal copula. The factors fd are immediately found to be: 

,2 2 

fd = e s with no restrictions on d. The Gaussian case now corresponds to s = 0. 

Although the inverse Gamma and the lognormal distributions are very different, 
it is well known that the tail region of the log-normal is in practice very hard to 
distinguish from a power-law. In fact, one may write: 



5-V i t?=a-( 1 +-«) (3.6) 



where we have introduced a = <j/<jq. The above equation means that for large a a 
lognormal is like a power-law with a slowly varying exponent. If the tail region corre- 
sponds to a € [4, 8] (say), the effective exponent v lies in the range [0.7/s 2 , 1.05/s 2 ]. 
Another way to obtain an approximate dictionary between the Student model and 
the lognormal model is to fix i/ c g(s) such that the coefficient f^ (say) of the two 
models coincide. Numerically, this leads to u c g(s) ~ 2 + 0.5/s 2 , leading tos~ 0.4 
for v = 5. In any case, our main point here is that from an empirical point of view, 
one does not expect striking differences between a Student model and a log-normal 
model — even though from a strict mathematical point of view, the models are very 
different. In particular, the asymptotic tail dependence coefficients t* v or t* l are 
always zero for the log-normal model (but r LL (p) or T ULJ (p) converge exceedingly 
slowly towards zero as p — > I). 



3.4. Pseudo- elliptical generalisation 

In the previous elliptical description of the returns, all stocks are subject to the 
exact same stochastic volatility, what leads to non-linear dependences like tail effects 
and residual higher-order correlations even for p = (see Fig. [1} . In order to be 
able to fine-tune somewhat this dependence due to the common volatility, a simple 
generalisation is to let each stock be influenced by an idiosyncratic volatility, thus 
allowing for a more subtle structure of dependence. More specifically, we writcQ: 

Xi = a* ■ ei i = l...N (3.7) 



a In vectorial form, we collect the individual (yet dependent) stochastic volatilities <Tj on the 
diagonal of a matrix D, and X = De can be decomposed as X = RDAU where U is a random 
vector uniformly distributed on the unit hypcrsphere, the radial component R is a chi-2 random 
variable independent of U with rank(A) degrees of freedom, and A is a matrix with appropriate 
dimensions such that AA^ = S. This description can be contrasted with the one proposed in 
Ref. 1251 under the term "Multi-tail Generalized Elliptical Distribution": X = R(u)AU with R 
now depending on the unit vector u = AU/\ \ AU\ ■ In other words, the description in l|3.7|) provides 
a different radial amplitude for each component, whereas |25l characterises a direction-dependent 
radial part identical for every component. The latter allows for a richer phenomenology than the 
former, but lacks financial intuition. 
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where the Gaussian residuals q have the same joint distribution as before, and are 
independent of the Cj, but we now generalise the definition of the ratios fd- 

Jd E[^]E[o-^] [ ' 

which describe the joint distribution of the volatilities. As an explicit example, we 
consider the natural generalisation of the log- normal model and write <Ji = ooie^, 
withEl £i ~ AT(0, s), and some correlation structure of the £'s: E[£t£j-] = s 2 c ir One 
then finds: 



j2 2 



fd(c) =e dsc . (3.9) 

Within this setting, the generalisation of coefficients Q3.2j) can be straightfor- 
wardly calculated. Denoting by r the correlation coefficient of and ej (now differ- 
ent from p) we find: 

p{r,c)= W)' r (3,i0a) 

<"<->- %ft^ 

r(r . c) = /ffl_I «, 10c) 

C*(r, c) = - + — arcsinr Vc (3.10d) 

4 27T 

These formulas are straightforwardly generalised for arbitrary description of the 
volatilities, using the coefficients fd defined in (|3.8[) instead of the explicit expres- 
sions given by (|3 .9|) . When c is fixed (e.g. for the elliptical case c = 1), p and r 
are proportional, and all measures of non-linear dependences can be expressed as 
a function of p. But this ceases to be true as soon as there exists some non trivial 
structure c in the volatilities. In that case, c and r are "hidden" underlying vari- 
ables, that can only be reconstructed from the knowledge of p, £ (2) , C* , assuming 
of course that the model is accurate. 

Notice that the result on C(\,\) is totally independent of the structure of the 
volatilitie^]. Indeed, what is relevant for the copula at the central point is not the 
amplitude of the returns, but rather the number of +/— events, which is unaffected 
by any multiplicative scale factor as long as the median of the univariate marginals 
is nil. An important consequence of this result is that for all elliptical or pseudo- 
elliptical model, (0 = implies that C*(p = 0) = \. We now turn to empirical data 
to test the above predictions of elliptical models, in particular this last one. 

b A further generalisation that allows for stock dependent "vol of vol" Si is also possible. 

c This property holds even when Ui depends on the sign of e;, which might be useful to model 

the leverage effect that leads to some asymmetry between positive and negative tails, as the data 

suggests. 
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3.5. A word on Archimedean copulas 

In the universe of all possible copulas, a particular family has become increasingly 
popular in finance: that of "Archimedean copulas" . These copulas are defined, for 
bivariate random vectors, as follows [44] : 

C A {u,v) = cj ) - 1 [^u) + ( j ) (v)] 1 (3.11) 

where <f){u) : [0, 1] — > [0, 1] is a function such that 0(1) = and is de- 
creasing and completely monotone. For example, Frank copulas are such that 
4>f(u) = \n[e e — 1] — ln[e — 1] where 9 is a real parameter, or Gumbel copulas, 
such that 4>g(u) = (— lnu) e , < 6 < 1. The asymptotic coefficient of tail depen- 
dence are all zero for Frank copulas, whereas r* ru = 2 — 2 e (and all other zero) for 
the Gumbel copulas. The case of general multivariate copulas is obtained as natural 
generalisation of the above definition. 

One can of course attempt to fit empirical copulas with a specific Archimedean 
copula. By playing enough with the function 0, it is obvious that one will eventually 
reach a reasonable quality of fit. What we take issue with is the complete lack of 
intuitive interpretation or plausible mechanisms to justify why the returns of two 
correlated assets should be described with Archimedean copulas. This is particularly 
clear after recalling how two Archimedean random variables are generated: first, take 
two U[0, 1] random variables s,w. Set w' = K~ 1 (w) with K(x) — x — cf>(x) / 4>' {x) . 
Now, set: 

u = [s<f>(w')] ; v = r X [(1 - s)<j>{w')} , (3.12) 

and finally write X\ = V~l(u) and X2 = r P~ 2 l { v ) to obtain the two Archimedean 
returns with the required marginals V\ and V2 [21 44 . Unless one finds a natural 
economic or microstructural interpretation for such a labyrinthine construction, we 
content that such models should be discarded a priori, for lack of plausibility. In the 
same spirit, one should really wonder why the financial industry has put so much 
faith in Gaussian copulas models to describe correlation between positive default 
times, that were then used to price CDO's and other credit derivatives. We strongly 
advocate the need to build models bottom-up: mechanisms should come before any 
mathematical representation (see also |34| and references therein for a critical view 
on the use of copulas). 

4. Empirical study of the dependence of stock pairs 

4.1. Methodology: what we do and what we do not do 

As the title stresses, the object of this study is to dismiss the elliptical copula as 
description of the multivariate dependence structure of stock returns. Concretely, 
the null- hypothesis "Ho: the joint distribution is elliptical" admits as corollary "all 
pairwise bivariate marginal copulas are elliptical and differ only by their linear 
correlation coefficient " . In other words, all pairs with the same linear correlation 
p are supposed to have identical values of non-linear dependence measures, and 
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this value is predicted by the elliptical model. Focusing the empirical study on the 
pairwise measures of dependence, we will reject Ho by showing that their average 
value over all pairs with a given p is different from the value predicted by elliptical 
models. 

Our methodology differs from usual hypothesis testing using statistical tools 
and goodness of fit tests, as can be encountered for example in [29 a for testing the 
Gaussian copula hypothesis on financial assets. Indeed, the results of such tests are 
often not valid because financial time series are persistent (although almost not lin- 
early autocorrelated) , so that successive realisations cannot be seen as independent 
draws of an underlying distribution, see the recent discussion on this issue in [5] . 

4.2. Data set and time dependence 

The dataset we considered is composed of daily returns of 1500 stocks labeled in 
USD in the period 1995-2009 (15 full years). We have cut the full period in three 
subperiods (1995-1999, 2000-2004 and 2005-2009), and also the stock universe into 
three pools (large caps, mid-caps and small caps). We have furthermore extended 
our analysis to the Japanese stock markets. Qualitatively, our main conclusions arc 
robust and do not depend neither on the period, nor on the capitalisation or the 
market. Some results, on the other hand, do depend on the time period, but we never 
found any strong dependence on the capitalisation. All measures of dependence are 
calculated pairwise with non-parametric estimators, and using all trading dates 
shared by both equities in the pair. 

We first show the evolution of the linear correlation coefficients (Fig. HJ and the 
upper and lower tail dependence coefficients for p = 0.95 (Fig. [5]) as a function of 
time for the large-cap stocks. One notes that (a) the average linear correlation ~p(t) 
fluctuates quite a bit, from around 0.1 in the mid-nineties to around 0.5 during the 
2008 crisis; (b) the distribution of correlation coefficients shifts nearly rigidly around 
the moving average value ~p(t); (c) there is a marked, time dependent asymmetry 
between the average upper and lower tail dependence coefficients; (d) overall, the 
tail dependence tracks the behaviour of ~p(t). More precisely, we show the time 
behaviour of the tail dependence assuming either a Gaussian underlying copula or 
a Student (y = 5) copula with the same average correlation ~p{t). Note that the 
former model works quite well when ~p{t) is small, whereas the Student model fares 
better when ~p{t) is large. We will repeatedly come back to this point below. 

We computed the quadratic £ (2) and absolute correlation coefficients for 
each pair in the pool, as well as the whole rescaled diagonal and anti-diagonal of 
the empirical copulas (this includes the coefficients of tail dependenc^l, see Sect. 12.31 
above). We then average these observables over all pairs with a given linear coef- 
ficient p, within bins of varying width in order to take account of the frequency 



d The empirical relative bias for p = 0.95 is < 1% even for series with only 10 3 points — typically 
daily returns over 4 years. 
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of observations in each bin. We show in Fig. [6] and t vu (0.95), r LL (0.95) as a 
function of p for all stocks in the period 2000-2004, together with the prediction 
of the Student copula model for various values of v, including the Gaussian case 
v = oo. For both quantities, we see that the empirical curves systematically cross 
the Student predictions, looking more Gaussian for small p's and compatible with 
Student v w 6 for large p's, echoing the effect noticed in Fig. [5] above. The same 
effect would appear if one compared with the log-normal copula: elliptical models 
imply a residual dependence due to the common volatility, even when the linear 
correlation goes to zero. This property is not observed empirically, since we rather 
see that higher-order correlations almost vanish together with p. The assumption 
that a common volatility factor affects all stocks is therefore too strong, although 
it seems to make sense for pairs of stocks with a sufficiently large linear correlation. 
This result is in fact quite intuitive and we will expand on this idea in Sect. 14.41 
below. 

The above discrepancies with the predictions of elliptical models are qualita- 
tively the same for all periods, market caps and is also found for the Japan data 
set. Besides, elliptical models predict a symmetry between upper and lower tails, 
whereas the data suggest that the tail dependence is asymmetric. Although most 
of the time the lower tail dependence is stronger, there are periods (such as 2002) 
when the asymmetry is reversed. 



4.3. Copula diagonals 

As argued in Sect. 12.31 above, it is convenient to visualise the rescaled difference 
Ad,a(p) between the empirical copula and the Gaussian copula, along the diagonal 
(p,p) and the anti-diagonal (p, 1 — p), see Eqs. (|2.11j) . The central point p = | 
is special; since Cg{\,\) = j, A</ ja (i) are both equal to AC* — 1, that takes a 
universal value for all elliptical models, given by Eq. (|3.2cl) . 

We show in Fig. [5] the diagonal and anti-diagonal copulas for all pairs of stocks 
in the period 2000-2004, and for various values of p. We also show the prediction of 
the Student v = 5 model and of a Frank copula model with Student v = 5 marginals 
and the appropriate value of p. What is very striking in the data is that Ad(p) is 
concave for small p's and becomes convex for large p's, whereas the Student copula 
diagonal is always convex. This trend is observed for all periods, and all caps, and 
is qualitatively very similar in Japan as well for all periods between 1991 and 2009. 
We again find that the Student copula is a reasonable representation of the data 
only for large enough p. The Frank copula is always a very bad approximation - 
see the wrong curvature along the diagonal and the inaccurate behaviour along the 
anti-diagonal. 

Let us now turn to the central point of the copula, C* = C(i, |). We plot in 
Fig. [7] the quantity — cos (2irC*) — p as a function of p, which should be zero for 
all elliptical models, according to Eq. (|3.2cp . The data here include the 284 equities 
constantly member of the S&P500 index in the period 2000-2009. 
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We again find a clear systematic discrepancy, that becomes stronger for smaller 
p's: the empirical value of C* is too large compared with the elliptical prediction. 
In particular, for stocks with zero linear correlation (p = 0), we find C* > |, i.e. 
even when two stocks are uncorrelated, the probability that they move in the same 
directional is larger than j . The bias shown in Fig. [7] is again found for all periods 
and all market caps, and for Japanese stocks as well. Only the amplitude of the 
effect is seen to change across the different data sets. 

Statistical errors in each bin are difficult to estimate since pairs containing the 
same asset are mechanically correlated. In order to ascertain the significance of the 
previous finding, we have compared the empirical value of C* (p) with the result of a 
numerical simulation, where we generate time series of Student (y = 5) returns using 
the empirical correlation matrix. In this case, the expected result is that all pairs 
with equal correlation have the same bivariate copula and thus the same C* so that 
the dispersion of the results gives an estimate of measurement noise. We find that, 
as expected, C*(p) is compatible with Eq. (|3.2cp . at variance with empirical results. 
We also find that the dispersion of the empirical points is significantly larger than 
that of the simulated elliptical pairs with identical linear correlations, suggesting 
that all pairs cannot be described by the same bivariate copula (and definitively not 
an elliptical copula, as argued above). 

All these observations, and in particular the last one, clearly indicate that Stu- 
dent copulas, or any elliptical copulas, are inadequate to represent the full depen- 
dence structure in stock markets. Because this class of copulas has a transparent 
interpretation, we in fact know why this is the case: assuming a common random 
volatility factor for all stocks is oversimplified. This hypothesis is indeed only plau- 
sible for sufficiently correlated stocks, in agreement with the set of observations we 
made above. As a first step to relax this hypothesis, we now turn to the pseudo- 
elliptical log-normal model, that allows further insights but still has unrecoverable 
failures. 

4.4. Pseudo- elliptical log-normal model 

As we just showed, the fact that the central value of the copula C* does not obey 
the relation — cos (2irC*) = p rules out all elliptical models. One possible way out is 
to consider a model where random volatilities are stock dependent, as explained in 
Sect. 13.41 Choosing for simplicity a log-normal model of correlated volatilities and 
inserting (|3.9[) into (|3.10[) , the new prediction is: 

p=-e s2( - c - 1) cos(2nC*), (4.1) 

where c is the correlation of the log-volatilities. This suggests to plot 
z = In (p/\ cos (27rC*) I) as a function of p, as shown in Fig. |7(b)[ For a purely 

e A more correct statement is that both stocks have returns below their median with probability 
larger than ^. However, the median of the distributions are very close to zero, justifying our slight 
abuse of language. 
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elliptical model, z should be identically zero, corresponding to perfectly correlated 
volatilities (c = 1). What we observe in Fig. |7(b)] on the other hand, is that c is in- 
deed close to unity for large enough p's, but systematically decreases as p decreases; 
in other words, the volatilities of weakly correlated stocks are themselves weakly 
correlated. This is, again, in line with the conclusion we reached above. 

However, this pseudo-elliptical model still predicts that C* = \ for stocks with 



zero linear correlations, in disagreement with the data shown in Fig. 7(a) This 
translates, in Fig. |7(b)| into a negative divergence of z when p — > 0. This finding 
means that we should look for other types of constructions. How can one have at 
the same time p = and C* > \ ? A toy-model, that serves as the basis for 
a much richer model that we will report elsewhere [6], is the following. Consider 
two independent, symmetrically distributed random factors ip\ and ip2 with equal 
volatilities, and construct the returns of assets 1 and 2 as: 

X lj2 = Vi±V2- (4.2) 

Clearly, the linear correlation is zero. Using a cumulant expansion, one easily finds 
that to the kurtosis order, the central value of the copula is equal to: 

c> = i + ^ + '- <«> 

Therefore, if the kurtosis k\ of the factor to which both stocks are positively exposed 
is smaller than the kurtosis K2 of the spread, one does indeed find C*(p = 0) > j. 

We will investigate in a subsequent paper [5] additive models such as the above 
one, with random volatilities affecting the factors ipi and ip2, that generalise ellip- 
tical models in a way to capture the presence of several volatility modes. 



5. Conclusion 

The object of this paper was to discuss the adequacy of Student copulas, or more 
generally elliptical copulas, to describe the multivariate distribution of stock returns. 
We have suggested several ideas of methodological interest to efficiently visualise 
and compare different copulas. We recommend in particular the rescaled difference 
with the Gaussian copula along the diagonal and the central value of the copula 
as strongly discriminating observables. We have studied the dependence of these 
quantities, as well as other non-linear correlation coefficients, with the linear corre- 
lation coefficient p, and made explicit the predictions of elliptical models that can 
be empirically tested. We have shown, using a very large data set, with the daily 
returns of 1500 US stocks over 15 years, that elliptical models fail to capture the 
detailed structure of stock dependences. 

In a nutshell, the main message elicited by our analysis is that Student cop- 
ulas provide a good approximation to describe the joint distribution of strongly 
correlated pairs of stocks, but badly miss their target for weakly correlated stocks. 
We believe that the same results hold for a wider class of assets: it is plausible 
that highly correlated assets do indeed share the same risk factor. Intuitively, the 
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failure of elliptical models to describe weakly correlated assets can be traced to 
the inadequacy of the assumption of a single "market" volatility mode. We expect 
that exactly as for returns, several factors are needed to capture sectorial volatility 
modes and idiosyncratic modes as well. The precise way to encode this idea into 
a workable model that would naturally generalise elliptical models and faithfully 
capture the non-linear, asymmetric dependence in stock markets is at this stage 
an open problem, on which we will report in an upcoming paper. We strongly be- 
lieve that such a quest cannot be based on a formal construction of mathematically 
convenient models (such as Archimedean copulas that, in our opinion, cannot be 
relevant to describe asset returns). The way forward is to rely on intuition and plau- 
sibility and come up with models that make financial sense. This is, of course, not 
restricted to copula modeling, but applies to all quarters of quantitative finance. 



Appendix A. Proof of Eqn. (|3.5[) , page [7] 

Lemma Appendix A.l. Let (X\,X2) follow a bivariate student distribution with 
v degrees of freedom and correlation p. Denote by T v (x) the univariate Student cdf 
with v degrees of freedom, and x p = T" 1 ^) its p-quantile, and define 



K 



Hp) 

Then, 

¥[K < k(p) | X 2 = x p ] = T v+1 (k(p)) 

Proof. The proof precedes straightforwardly by showing that t u+ i(k) = 
T~ > x 1 \x 2 ( x i\ x 2)%r when x 2 = x p . The particular result for the limit case p = 1 
is stated in [14]. □ 

Theorem Appendix A.l. Let {X\,X2j be defined like in the Lemma above. 
The pre- asymptotic behaviour of its tail dependence when p 1 is approximated by 
the following expansion in (rational) powers of (1 — p): 

Tuu(p) = t* uv {v, P ) + p)-(l- p)i + 0((1 - p)i) 



- 1 X 1 - px p 



^v + l^/T~p~ 
VTTp~ 



Mi) 



(A.la) 
(A.lb) 



f This result was simultaneously found by Manner and Segers 1311 (in a somewhat more general 
context). We still sketch our proof because it follows a different route (uses the Copula), and the 
final expression looks quite different, although of course numerically identical. 
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with 

T* vv {v,p) = 2-2T v+1 (k{l)) (A.2) 
-+i 

p{v,p) = ^—k{l)t v+1 (k{l))LZ i (A.3) 

V 

[n^)J 1 + 1 vrr-p ' tv+1 { vr+-p 

Proof. 

Recall from Eq. ((231) that 



Tm(p) _ 2 - C '")-f'"> =2-—!- /' (A.4, 
P - 1 1 - P Jp dp 

One easily shows that 

dC(p,p) =2P[Xi <i p |I 2 = a; p ]dp = 2T u+1 {k{p))dp 

where the second equality holds in virtue of the aforementioned lemma. 
Now, for p close to 1, k(p) = fc(l) (l - + ©(x" 4 ), and 

= 2T l/+1 (ft(p)) a 2T„ +1 (ft(l)) - 2(fc(l) - k(p)) ■ t v+1 (k(l)) 

But since the Student distribution behaves as a power-law precisely in the region 
p < 1, we write t v (x) ~ L v jx v+1 and immediately get 

l-p=^x 7 with L v = ^=v* 

The result follows by collecting all the terms and performing the integration in 
(TOl) . □ 

Notice that for large v, the exponent is almost zero and the correction term is 
of order 0(1), and the expansion ceases to hold. This is particularly true for the 
Gaussian distribution (y = oo) for which the behaviour is radically different at 
the limit p —> 1 (where there's strictly no tail correlation) and at p < 1 where a 



dependence subsists, see Fig. 2(a) 
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Fig. 4. Time evolution of the coefficient of linear correlation, computed for the stocks in the 
S&P500 index, with an exponentially moving average of 125 days from January 1990 to December 
2009. Five quantilcs of the p distribution are plotted according to the legend, showing that this 
distribution moves quasi-rigidly around its median value. 
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Fig. 5. Time evolution of the coefficient of tail dependence at p = 0.95, computed for the stocks in 
the S&P500 index. A sliding flat window of 250 days moves monthly (25 days) from January 1990 
to December 2009. We depict in turquoise and red the predictions for Student pair with v = 5 
and v = oo respectively, based on the evolution of the mean value p{t) of the cross-sectional linear 
correlation (dashed) or averaged over the full instantaneous distribution of p(t) (plain), with little 
difference. 
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Fig. 6. Empirical (2000-2004) and elliptical. The fact that the empirical points "cross" the theo- 
retical lines corresponding to elliptical models reveals that there is a dependence structure in the 
volatility, more complex than what is suggested by a single common stochastic scale factor. 
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Fig. 7. Left: — (cos(27rC*) + p) versus the coefficient p of linear correlation. The grey cloud 
is a scatter plot of all pairs of stocks. Black dots correspond to the average value of empirical 
measurements within a correlation bin, and red crosses correspond to the numerically generated 
elliptical (Student, v = 5) data, compatible with the expected value zero. Statistical error are 
not shown, but their amplitudes is of the order of the fluctuations of the red crosses, so that 
empirics and elliptical prediction don't overlap for p < 0.4. This representation allows to visualise 
very clearly the systematic discrepancies for small values of p, beyond the average value, fnset: 
the f s.d. dispersion of the scattered points inside the bins. Right: z = In (p/| cos (27rC*) |) as a 
function of p, with the same data and the same conventions. This shows more directly that the 
volatility correlations of weakly correlated stocks is overestimated by elliptical models, whereas 
strongly correlated stocks are compatible with the elliptical assumption of a common volatility 
factor. 
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Fig. 8. Diagonal (left) and anti-diagonal (right) of the copula vs p. Real data (2000-2004) averaged 
in 10 bins of p. The vertical lines corresponds to 1-sigma dispersion of the results (i.e. not the 
statistical error bar). The turquoise line corresponds to an elliptical model (Student with v = 5): 
it predicts too low a value at the centre point and too large tail dependences (left and right limits). 
The blue line depicts the behaviour of Frank's copula. It is worth noticing that the tail dependence 
coefficient is always measured empirically at some p < 1, for example p = 0.95. In this case, the 
disagreement between data and an elliptical model may be accidentally small. It is important to 
distinguish the interpretation of the limit coefficient from that of a penultimate value [6]. 



